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INTRODUCTION 



The aim of this paper is to propose a general technique for the analysis 
of structural data. It will be shown that three different approaches to 
analyzing structural data all have essentially the same solution, and 
also, as an added bonus, that the technique automatically give the 
"cliques" in the structure. The technique is then illustratively 
applied to data on the pattern of overlap in membership among voluntary 
activities in a high school. 

In considering the three different approaches, we will let W be a real 
symmetric matrix of relationships or sociometric choices. All values in 
W are sssumed to lie between zero and one inclusive. For example, we 
might let W. ■ l if i and J are friends and W.. • 0 otherwise. The 
elements on Che main diagonal are zero. We will now look at three 
approaches to devising measures of popularity or centrality in this 
structure. 

THE FACTOR ANALYSIS APPROACH 

The bonds that form in a group could be conceived of as the result of 
"interaction potentials" possessed by each individual. If W were a 
matrix of friendships, S. would be individual i's propensity to form 
friendships. The actual bond between i and J, Wj., should be close to 
S^Si, In matrix notation, we want to calculate a column vector S such 
that the sum of the squared differences between SS and W is minimized. 

W is like a peculiar correlation matrix with zero communalities and the 
criterion for S is identical to the criterion for the first principal 
components factor of a correlation matrix (minimizing squared differences). 
The vector S (and the first principal components factor) is the 
eigenvector of the largest eigenvalue of the matrix standardized so that 
its length is the eigenvalue. 

i 

An interesting byproduct of the factor analysis is that there is an 
exact parallel between the f Actor structure and the clique structure. 

Define a clique as a set of individuals such that no relationships 
extend outside the clique and all individuals within the clique are 
related directly or indirectly through chains of choices. Each clique 
will be represented in the factor structure by a factor all of whose 
elements are greater than or equal to zero (Appendix C). The members of 
the clique will have nonzero positive loadings on this factor and all 
those who are not members of that clique will have zero loadings 
(Appendix A). Thus, the clique structure can be read at a glances each 
clique will have its own vector of popularity scores, 

The individuals who load high on a factor all of whose elements are 
greater than or equal to zero will be especially popular in their 
clique. For each clique the magnitude of the eigenvalue of its 
eigenvector of popularity scores will be a measure of how good the 



eigenvector is in summarizing the relationships in the clique. In a 
factor analysis of a correlation matrix these eigenvalues tell how 
much of the variance each factor accounts for. 

It is well known that W is factorable even when it is not symmetric. 

If W is asymmetric it is possible to factor it into a row and a column 
factor whose product is a least squared error estimate of W (Wright and 
Evitts, 1961). The row factor is an eigenvector of WW' and the column 
f sc tor is an eigenvector of W'W. W is assumed to be symmetric in this 
paper because then there is the necessary convergence between the three 
approaches. If W were not symmetric the weighting techniques to be 
described next may have no real number solution at all. 

THE CONVERGENCE OF AN INFINITE SEQUENCE 



A simple measure of poularity is Just the number of friends each person 
has. It might be desired to create second order indices of popularity 
by weighting eoch person's choice of others by the number of choices he 
receives. But there is no reason to stop at this point. If the second 
order indices are better than the first order indices (the number of 
friends)! then third order indices in which each person's choice of 
others is weighted by his second order measure of popularity should be 
even better. 

Let be a column vector of ones. Then Sp the first order popularity 
measure! is Just ■ WSq. The second order measure is S2 " WS^ * V‘Sq. 
The mth order measure is S m ■ W^Sq. 

These popularity measures tend to become infinitely large. However, 
there is a small modification that does allow S to converge to a set 
of nonzero popularity scores as m approaches infinity. Let A be 
the highest eigenvalue of W. At each stage divide the result oy the 
largest eigenvalue. S_ • WS_.|/A 1 - W m S 0 /A™. S converges to an 

eigenvector of A. (Appendix B) . Thus! the solution is almost 
Identical to the factor analytic solution. The final popularity scores 
are described by an eigenvector of the largest eigenvalue. This is 
equivalent to the most powerful factor in a factor analysis, if the 
eigenvector is standardized so that its length is the eigenvalue. How* 
ever, if there is more than one clique the limit of the S scores will 
only give the popularity scores for the clique with the largest 
eigenvalue, which will tend to be the largest clique; entries for all 
nonmembers of the clique, including both Isolates and those who belong 
to smaller cliques, will be tero. Thus, this approach is a little less 
general than the other two. 

SIMULTANEOUS LINEAR EQUATIONS 

Suppose we look at the limit of the previously described process. We 
want to weight each person's contribution to the popularity of others 
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by his own popularity. We can net up a system of homogeneous linear 
equations for the unknown popularity scores. For each Individual 1: 



- w llSl 



+ W£2®2 + 



• w m s n 



Each popularity score should be positive or zero. 

This system of equations is matrix form is S ■ WS, or (W-I)S “0, where 
the vector S is the unknown popularity scores. These equations have a 
nonzero solution only under the unlikely condition that det(W-I) ■ Q. 

We would modify the equations above by multiplying the left hand side by 
a constant A . This modification does not violate the spirit of 

the model and it allows a solution to the equations. 



A S. 



W il S l 



••• +W in S n 



Then the equations in matrix form are WS ■ AS, or (W- AI)S ■ 0. This 
is the familiar problem of finding eigenvalues and eigenvectors. A 
is an eigenvalue and S if an eigenvector. 



Although every W will have eigenvalues and eigenvectors, the question 
remains as to whether there is a desirable solution. A should be 
positive and each S { should be positive. There should be just one 
solution; we should^ot be faced with an arbitrary choice of possible 
eigenvectors. In Appendix C it is demonstrated that each clique will 
have a positive A such that all elements of its eigenvector are 
greater than or equal to zero. Moreover, Appendix A shows that these 
solutions do not contradict one another. 



Thus, all three approaches have the same solution. The solution is 
always the eigenvector of the largest eigenvalue for each clique. There 
are minor differences. In the factor analysis approach the eigenvectors 
are standardised so that their lengths are their eigenvalues. The 
limiting process gives a solution only for the clique with the Isrgest 
eigenvalue (which makes no difference if there is just one clique). 

The chief difficulty will be with the assumption of symmetry. Some 
sociometric structures are naturally symmetric or can reasonably be 
made so, and the technique can be applied directly. For example, being 
friends might be defined so that if they do not choose each other they 
are not friends. Talking to one another, spending time together, dating, 
and other behavioral relations are naturally aymnetric. An index of 
centrality in a coonunicatlon structure might be desired, as a clue 
to how fast information will spread from an individual or to how much 
power an individual has because of his key position. This structure 
will be symmetric if the communication channels are two*way. 

However, other relations will lnalterably esysnetrlc, such as nominations 
for the most powerful or the most popular in a group. A plausible agrument 
can be given for making a matrix of nominations sya*trlc. (W4V')/2 is a 



symmetric matrix and it could be used instead of W. One reason for 
giving the nominations of the more powerful or popular more weight is 
that they may have better knowledge of what the status system is. If 
this is true then it is consistent to let status be determined in part 
by the statuses of those one nominates as well as the statuses of one'u 
nominators, and the analysis of (W-W')/2 will do this. 

HUBBELL 



The closest approach to that presented here is Katz's (1953) method 
of calculating status indices and Hubbell'a (1965) method of clique 
identification. Since Hubbell's technique includes Katz's, only Hubbell's 
will be discussed here. 

Hubbell's basic equation for status scores is (1965: 382): 

S - E + WS 

S is a column vector of status scores. W is called by Hubbell a 
"structure" matrix. All values of W are less than or equal to one in 
absolute value. E is a vector of "exogenous contributions" to status in 
the system, aspects of a person's status that are not reflected in his 
nominations by other group members. Each individual's status is the 
sum of his nominations by others weighted by their status plus contri- 
butions of other factors (represented by e^) . 

There is no discussion in the article of how the e^'s are to be 
Independently determined. It is suggested that in the absence of 
external evidence E might as well be a vector of I's. Thus, it seems 
fair to say that with sociometric data, in which W is given but not 8, 

E is a mathematical convenience that turns a set of unsolvable homo- 
geneous equations ((W-I)S*0) in to a set of solvsble nonhomogeneous 
equations ((W-I)S*B), Just as X was introduced as a convenience. 

It is easy to see that tiibbell's technique is very similar to the 
technique proposed in this paper. What are (.heir relative advantages 
and disadvantages? 

1, Hubbell'a approach requires the arbitrary assumption of some 
E vector, this appears to be true of the technique when it 
is applied to sociometric choice matrices rather than to the 
economic problems it was designed for. On the other hand, 

X in the "factor analysis" approach is not arbitrarily 
supplied by the researcher; it comes from the data itself. 

2. The solution vector S in the factor analysis model has a data 
reduction interpretation. SS' is a least squared error 
estimate of W. This is not an Interpretation that fits Hubbell's 
model. 
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3. Cliques can easily be Identified using the factor analysis 
approach. Kubbell produces a refined measure of the relation- 
ship between every pair of group members , the 6um of all the 
direct and indirect paths between them, but cliques must still 
be grouped together "by hand." 

4. Hubbell's technique has a nice interpretation that the factor 
analysis approach lacks. The elements of S are the sums of all 
the direct and indirect paths between a given individual and 
all other individuals. 

5. Hubbell's approach permits negative values in the matrix Vf and, 
most importantly, it does not require that tf be symmetric." 

This summary was not meant to favor Hubbell's approach or the factor 
analyaia approach but only to show that they had different advantagea 
and disadvantagea. 

OVERLAPPING GROUPS AND CENTRALITY 




Instead of applying the technique to matrices describing friendships or 
communication patterns between individuals, a sociologically relevant 
but not often examined kind of data will be analysed. This is the 
pattern of overlap in membership between groups, a relationship between 
groups rather than between individuals. Data about overlbp are often 
easily available and they can be important. . 

In a structure of overlapping groups "centrality" is analogous to 
popularity. Hie centrality of a group is related to the extent of its 
overlap with other groups, but it Is aUo affected by the centrality 
of the groups it overlaps with; a group is more central if it overlaps 
with central groups than if it overlaps with noncentral groups. 

There are many instances in which centrality is Important. Consider 
the following! 

1. In studying the spread of rumors and other communications in 
a system of groups, the centrality of a group in the pattern 
of overlapping memberships could be a clue about how rapidly 
information in a given group will be transmitted to the social 
system as a whole. Information should spread especially fast 
from groups that overlap with other central groups. 

2. In examining the pattern of Interlocking directorates among 
the largest corporations, centrality might be a clue about 
Influence. The directors of central corporations would have 
more widespread and extensive contacts than the members of less 
central corporations not only because they belong to more 
boards but also because these other boards would also contain 
active directors with many contacts. Centrality might also 
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bear a relation to influence in the pattern of voluntary 
organisations in a snail community. 

3. In a high school central activities might tend to be those 

with the highest status in the high school. Central clubs and 
activities would be those that active students belonged to. 

In calculating centrality indices, each overlap of a group with another 
group is weighted by the centrality of the other group. Thus, the 
calculation of centrality indices is identical to the calculation of 
popularity indices. For each isolated set of groups (analogous to a 
clique) the eigenvector of the largest eigenvalue of W is the desired 
vector of centrality scores. 

A MEASURE OF OVERLAP BETWEEN GROUPS 



We need a measure of overlap between groups that is standardized so that 
the sheer sizes of the two groups does not by itself affect the measure. 
Just as the product moment correlation coefficient is the covariance 
standardized so that it is unaffected by the variance of the two 
variables. The ideal measure of overlap would take the value .00 if 
there were no overlap and 1.00 if there were the maximum possible 
overlap between the two groups. It would take some standard value, say 
.05, if membership in the two groups were statistically independent. 
Standard measures of association are Inadequate because they are zero 
if there is statistical Independence and negstive if there is no over- 
lap. We want the measure to be positive in the former case (there is 
some overlap) and zero in the latter. 



The following measure r is one among many possible standardized 
measures of overlap that have the desired properties. Let c be the 
value r should have when the amount of overlap between groups A and 
B is the amount expected if membership in the two groups were inde- 
pendent. Let u. be the expected overlap (if membership in the two 
groups were Independent) divided by the maximum possible overlap, and 
let t.. be the actual overlap between groups A and B divided by the 
maximum possible overlap. Define r ^ as follows! 



r. 



r AB " 



log c 
log u^g 



AB 



when A 4 B 



r AA " 0 

It can easily be checked that this sttasure has the required properties. 
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DATA 



The pattern of overlap among a set of voluntary student activities in a 
high school was examined. A high school year-book was used as a source 
of data. The year-book that was used was not selected in any systematic 
way; it happened to be easily available. The high school was Menlo- 
Atherton High School in Atherton, California and the year was 1956. 

This makes it more comparable with Gordon's (1957) study, with which it 
will be compared, than if a more recent yesr-book had been used. 

No additional data outside the year-books were collected. There is no 
data on student attitudes toward the groups or about the informal social 
system in the high school. Without other scurces of data as a check 
the technique can not be validated; it has only face validity. However, 
it will be shown that the results of the analysis correspond to common 
sense expectations about high schools. 

The eigenvectors and eigenvalues of the matrix of standardized overlaps 
were computed. All the entries in the eigenvector with the largest 
eigenvalue were strictly positive. Therefore, all activities were 
connected directly or indirectly and this eigenvector gives all the 
centrality scores. The list of activities from the most central to 
the least central is as follows. 

(Table 1 about here) 




This list is consistent with common sense expectations, for the most 
part. The sole entrance criterion of the most central activity was 
that the student participate in many school activities. Activities 
with junior and seniors are more central than equivalent activities 
with freshmen and sophomores. For example, the Junior and Senior 
Boards are more central than the Freshman and Sophomore Boards , and 
the varsity sports are all higher than their frosh-soph counterparts. 
Among the varsity sports basketball, football, and baseball are the 
highest and they are followed by track, swimming, waterpolo, tennis, 
and wrestling; the major sports score higher than the minor sports. 

The student newspaper is more central than the yearbook staff. Clubs 
that do not represent the school nor perform a function for the school 
as a whole but merely satisfy members 1 private Interests (model airplane 
club, astronomy club, archery club, Future Teachers of America, etc.) 
tend to score low. 

One interesting feature is one of the two science-related clubs, the 
astronomy club, scores very low, while the other, the radio club, scores 
in the middle. It might be interesting to see if the status of science 
clubs has changed since 1956. 

Centrality seems to be related to status or prestige. In Gordon's 
(1957) book, The Social Structure of the High School , a summary of 
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student evaluations of the prestige of SO student activities is given. 

If it is assumed that prestige is related to centrality and that the 
prestige ranking of activities in Gordon's high school and the high 
school under study are similar, then the centrality measures and the 
prestige ratings ought to be positively associated for those activities 
that were in both high schools. Not only should it be positively 
correlated. The centrality indices, which are weighted sums of overlaps, 
should be more highly correlated with prestige than are the unweighted 
sums, the row sums of the matrix of overlaps, if anything has been 
gained by weighting. 

There were 21 activities that existed in both high schools. The 
Spearman rank order correlation between prestige (from Gordon's data) 
and the unweighted amount of overlap (the row totals) is .41. The 
correlation of the prestige scores with the centrality measures is .49. 
This is a small difference, but it is in the right direction.*’ 

In comparing the two lists, the centrality scores and Gordon's status 
scores, there seems to be one systematic difference. Sports are higher 
in Gordon's list, especially the major sports. This could reflect a 
difference in the high schools. Gordon's high school was in a small 
Mid-western town and Menlo-Atherton is located in a wealthy San 
Francisco suburb. 

In the systems we are examining it is assumed that there are two 
different types of groups: membership groups which are real, which 

meet, etc., ;uid categories, which never function as groups and which 
the membe: a nay not even be aware of. The centrality of a category is 
the centrality of the groups its members belong to. The centrality 
scores for categories will be a weighted combination of their overlap 
with the groups in the system, where each overlap is weighted by the 
centrality of that group. Students were categorized from the yearbook 
according to their year in school, sex, and race. It was found that 
centrality Increased from the freshman to the senior years, that females 
were more ceutral than males, a tendency that increased with the years, 
that Negroes were very low in centrality, but that their centrality also 
Increased with the years. These common sense findings support the 
technique* 

The finding that females were more central than males, especially in 
the senior year, is interesting. It could be that high school was more 
of a high point in the life of females than males. From high school 
many of he boys would rise in status through attendance at college and 
through their occupation. For many of the girls their independent life 
would end soon after high school in marriage, and so females may have 
directed more energy to their "last chance" for individual accomplishment 

EDUCATIONAL APPLICATIONS 

In this paper the technique was used to devise centrality measures for 
voluntary activities in one high school. Centrality appeared to be 



related to status. This measure of the status of activities could be 
used in conjunction with ot'ier measures or it has the advantage of 
being usable even if one has only year-books. 

Some interesting possible applications of the model would involve com- 
paring high schools. For example, how hau the centrality of the various 
activities changed with time? A study of which groups have risen and 
which groups have fallen in centrality-status would be interesting. Have 
sports risen or fallen in centrality? Have science clubs risen or 
fallen in centrality in the past decade? 

Another approach is to study the relationship between how much an 
Individual participates in "central' 1 organizations and the probability 
that the student goes to college. The correlation between the centrality 
of the organizations an individual participates in and whether or not 
he goes to college could be used as a way of characterizing high schools; 
some high schools might be more effective "launching pads" than others. 

CONCLUSIONS 



There are situations in which one wants to count the number of 
relationships that an individual has: in order to measure popularity, 

or power, or centrality in a communication system as examples. There 
are ways of modifying and perhaps Improving this simple operation. One 
approach is to calculate. a (column) vector of "interaction potentials" 
such that SS 1 is a close approximation to the matrix of relationships; 
S^S j is close to the relationship between individuals i and j. A 
se.cond approach appears to be quite different. Instead of simply 
addiug up the number of relationships each person has one might wish to 
weigh these relationships, because a relationship with a more central 
or popular individual contributes more to ones own popularity or 
centrality. In the paper two somewhat different approaches toward 
calculating these weights are described. 

In asymmetric matrices "factoring" and 'Weighting" have different 
solutions (illustrated by Wright and Evitts, 1961, and Hubbell, 1965, 
respectively) . However, when W is symmetric certain eigenvectors of 
W are both "factor" solutions and "weighting" solutions. A solution 
vector S can be Interpreted both as a least squared error reduction of 
the matrix of relationships and as a set of weights for relationships. 
Moreover, cliques are easily identified. 

The pattern of overlaps among a set of groups would seem to be a 
distinctively sociological object of study. When applied to data on 
the pattern of overlapping memberships among a set of groups, this 
technique gives a solution vector that is the best (least squared error) 
set of measures of the tendencies of the groups to overlap with each 
other and is also a set of weights so that each group's centrality is 
affected by the centrality of the groups it overlaps with. 
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The pattern of overlapping relations among a set of voluntary activities 
In a high school was examined. It was suggested that centrality would 
be a convenient measure of the status of these activities In a high 
school was examined. It was suggested that centrality would be a 
convenient measure of the status of these activities that could be 
derived solely from year-books. 
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TABLE l 



Centrality Measures 



1. Golden Key Society .291 

(honorary club for active students) 

2. California Scholarship Federation .253 

(honorary club for students with high GPA 

3. Senior Board .235 

(legislature of senior class, elected 

4. Girls Block Society .228 

(honorary athletic society 

5. Junior-senior student council .227 

(student legislature, elected 

6. "A" student court .226 

(tries violators of student conduct rules) 

7. French Club .213 

8. Cultural Board .212 

(planned school entertainment) 

9. Student newspaper staff .205 

10. Spanish Club .200 

'll. Girls Association .197 

(planned events relevant to girls) 

12. Girls Athletic Board .171 

(promoted and planned lntrmural girls ' sports) 

13. Publications Board .171 

(supervised student publications) 

14. A Capella . 169 

(choir) 

15. Board of Welfare .166 

(cited violators of conduct rules) 

16. Yearbook staff .165 

17. Social Board .162 

(planned school dances) 

18. Literary magazine staff .147 

19. Boys Block Society .143 

(honorary athletic soclecy) 

20. Chorallers .139 

21. School Spirit Board , .133 

(staged rallies 

22. Pom Pom girls .128 

23. Cub Staff .115 

(assistants to newspaper staff) 

24. Junior Board .114 

25. Forensics .114 

(debating club) 



26. Boys Athletic Board .109 

(exercised student control over intramural 

and extramural athletics) 

27. Finance Board .104 

(raised money for student activities) 

28. Varsity basketball .100 

29. Varsity football .099 

JO. Band .095 

31. Audio visual crew .084 

32. Radio Club .079 

33. Varsity baseball .077 

34. Varsity track .076 

35. "B" student court .075 

36. Players Club .074 

(planned and acted school plays) 

37. Golf .073 

38. Varsity swimming .069 

39. Cheerleaders .067 

40. Dance band .067 

41. Varsity water polo .066 

42. Orchestra .053 

43. Publicity Board .046 

(publicized school events) 

44. Junior Statesmen .043 

(attended annual state convention patterned 

after state government) 

45. "B" basketball .036 

46. Tennis .033 

47. Future Teachers .030 

48. Freshman Board .030 

49. Frosh-soph water polo .028 

50. Frosh-soph student council .027 

. 51. Archery Club .026 

52. "B" swimming .024 

53. Majorettes .023 

54. Frosh-soph football .023 

55. Wrestling .022 

56. Red Cross .021 

57. Sophomore Board .020 

58. "C" swimming .019 

59. "C" track .018 

60. "B" track . .016 

61. "D" basketball .015 

62. Frosh-soph basketball .013 

63. "C" basketball .010 

64. Astronomy Club .008 

65. Model Airplane Club .001 
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Appendix A 



In Appendix C it is shown Chat chere is associated with each clique 
an eigenvector all of whose elements are greater than or equal to zero. 
Here we wish to show that if an eigenvector of the matrix W has all 
positive elements the relationship between any of the individuals with 
strictly positive entries and any of the individuals with zero entries 
will be zero, and thus the eigenvectors with all positive or zero elements 
separate the system into sets of individuals who are not related to each 
other. 

1. We have the following system of equations: (W-AI)S ■ 0, where 

Wii • 0, W » 0 for every i and J. 

2. Suppose that a solution exists for which > 0, Sj ■ Sj ■ . . . 

■ S ■ 0. An examination of the above equations shows that this 
implies that W,, ■ W 3l ■ . . . ■ W fll * 0. The first group does 
not overlap with any other group. 

3. Suppose that a solution exists for which S. > 0, S» > 0, . . . 

V°- Vt ' S k« " • • • - S n ' °> “ he " k » 2 ‘ 

The last n-k equations now express n>k relationships among the 
first k (strictly positive) status scores. All the relationships 
sum to zero and the k(n-k) coefficients are all the relationships 
between the first k and the last n-k individuals. If any of the 
relationships were greater than zero others would have to be 
negative because every sum is zero and the first k status scores 
arc strictly positive. But no relationships are negative. 

Therefore All of these relationships W^, i^k, J>k, are zero. 



Appendix B 



He wish to show that a sequence of status scores converges to an 
eigenvector of the largest eigenvalue of W. 

Let X, be the eigenvalue of W that Is greatest in absolute magnitude, 
assume 1 it is of multiplicity one, and that it is strictly greater in 
absolute value than any other eigenvalue. The qualifications will 
almost certainly be true for any actual matrix W. Let {u. , u r } -V 

be a set of orthonormal eigenvectors for W, where u^ is the eigenvector 
of X x • The eigenvalues of W are real and the eigenvectors are orthogonal 
because tf is symmetric. It is shown in Appendix C that X. is positive 
and that every element of u^ is greater than or equal to zero (or else 
every element is less than or equal to zero - it makes no difference). 

Let S. be a column vector of ones. Because the orthonormal eigenvectors 

of W are a basis for the vector space, Sq • £ Ciu^. Multiplying through 

by u, 1 , 0^ ■ u. 'S q >0, because every element or u. is greater than or 

equal to zero but at least one element is nonzero. Define S ■ WS ./A. . 

m m-i x 

S -W m S/Xi *W®ECu/Ai " EC ( Xj/ Xi ) m u -V 

mO- 1 - i i 1 i 1 1 i ' 



■ c i u i 

The last term approaches zero as m increases. Therefore, approaches 
C u , which is also an eigenvector of X^ because ^'o» 

If the division were by any number smaller than X^ in absolute value 
S would diverge. If division were by any greater number S m would 
converge to a zero vector. If division were by a number of the same 
magnitude but of opposite sign, S would oscillate. 
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Appendix C 



We want to show that for each clique there exists a positive eigenvalue 
with an eigenvector all of whose elements are greater than or equal to 
zero. However, the eigenvalues and the eigenvectors of the matrix W 
are Just the eigenvalues and the eigenvectors of the aubmatrices 
accociated with each clique. Therefore, all that will be shown is that 
the eigenvalue of W that is largest in absolute magnitude is also positive 
and that it has an eigenvector all of whose elements are greater than or 
equal to zero. Then it will follow that an eigenvector of the largest 
(positive) eigenvalue of the submatrix associated with each clique has 
elements all of which are greater than or equal to zero. 



1. 



2 . 



Let W be a symmetric matrix all of whose elements are greater 
than or equal to zero. Let { X., Xg, •••* X n ) be its eigenvalues 
and let { u., u 2 u n > Be a cotresponding set of ortho- 

normal eigenvectors. The eigenvalues of W are real and the 
eigenvectors are orthogonal because W is symmetric. Assume, as 
in Appendix B, that X.^ is greater in absolute value than any 
other eigenvalue. 



u ll 

U 2l 



nil 



Assume without loss of generality that u^ ♦ 0. 



3. 



Let 



fl 

0 






Because the eigenvectors of the symmetric matrix W span the 
vector space, 2. C|Uj “ x. Multiplying through by up c i a 
u^'x ■ u^ +0 

4. Define x^ a s the first column of the rath power of the matrix W. 
Because every element of W is positive or zero, it is clear that 
every element of W n Is also positive or zero, and hence every 
element of x,,, is positive or zero. 

5. Xjjj ■ W m x ■ tf™ E CjU£ " EcjW m u^ ■ E Cj A^Uj. 




c l VS + i | 2 c i \S * u ll X J u l + i| 2 C l X iS 



IS 



- 2 - 



6. We know that X + 0, + 0. Therefore, 

V (u n > ■ “l + (l/ “u> i\z c i< *1 ' A 1 >\ 

As m approaches infinity, the last term approaches zero because 
X^ is greater in m absolute value than any other eigenvector 
Therefore, ^/(u^ ) approaches uj. All elements of ^ 

are positive or zero. Therefore, all elements of u, are greater 
than or equal to zero or they are less than or equal to zero, 
depending on the sign of o,.^" . If u, is an eigenvector, so 
is -up Thus \ has an eigenvector with all elements greater 
than or equal to zero. 

7. Moreover, X > 0, because if X^ were negative, ^/(unAj 0 ) 
would oscillate from positive to negative values, Instead of 
approaching the nonzero vector u^. 
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Appendix D 



Characteristics of students in 
Menlo-Atherton High School in 1956 



Year in school 



Freshmen 565 

Sophomores 454 

Juniors 421 

Seniors 379 

Sex 

Male 929 

Female 890 



Race (identified from yearbook pictures) 

Caucasian 1748 
Negro 28 
Oriental 43 
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